Group-delay-deviation based spectral analysis of speech
نویسندگان
چکیده
In this paper, we investigate a new method for extracting useful information from the group delay spectrum of speech. The group delay spectrum is often poorly behaved and noisy. In the literature, various methods have been proposed to address this problem. However, to make the group delay a more tractable function, these methods have typically relied upon some modification of the underlying speech signal. The method proposed in this paper does not require such modifications. To accomplish this, we investigate a new function derived from the group delay spectrum, namely the group delay deviation.We use it for both narrowband analysis and wideband analysis of speech and show that this function exhibits meaningful formant and pitch information.
منابع مشابه
Modified Group Delay Based MultiPitch Estimation in Co-Channel Speech
Phase processing has been replaced by group delay processing for the extraction of source and system parameters from speech. Group delay functions are ill-behaved when the transfer function has zeros that are close to unit circle in the z-domain. The modified group delay function addresses this problem and has been successfully used for formant and monopitch estimation. In this paper, modified ...
متن کاملSpeech recognition using long-term phase information
Current speech recognition systems use mainly amplitude spectrum-based features such as MFFC for acoustic feature parameters, while discarding phase spectral information. The results of perceptual experiments, however, suggested that phase spectral information based on long-term analysis includes certain linguistic information. In this paper, we propose the use of phase features based on long-t...
متن کاملExcitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation
A group delay-based excitation source analysis and design method is introduced for extension of TANDEM-STRAIGHT, a speech analysis, modification and synthesis system. This introduction makes all components of the system be based on interference-free representations. They are power spectrum, instantaneous frequency and group delay representations. This unification has potential to solve the majo...
متن کاملOn the Use of Phase Information in Speech Recognition
This study addresses the use of short−time phase spectra in automatic speech recognition (ASR). Two recent studies have proposed two group delay based spectral representations. Here we propose three new group delay based representations and compare usefulness of all these representations in an ASR experiment. We show that two of the representations we propose perform better, contain equivalent ...
متن کاملSignificance of Joint Features Derived from the Modified Group Delay Function in Speech Processing
This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC. The conventional group delay function fails to capture the resonant structure and the dynamic range of the speech spectrum primarily due to pitch periodicity effects. The group delay function is modified to suppress thes...
متن کامل